Multifractal characterisation of complete genomes

نویسندگان

  • Vo Anh
  • Ka-Sing Lau
  • Zu-Guo Yu
چکیده

This paper develops a theory for characterisation of DNA sequences based on their measure representation. The measures are shown to be random cascades generated by an infinitely divisible distribution. This probability distribution is uniquely determined by the exponent function in the multifractal theory of random cascades. Curve fitting to a large number of complete genomes of bacteria indicates that the Gamma density function provides an excellent fit to the exponent function, and hence to the probability distribution of the complete genomes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multifractal characterization of complete genomes

This paper develops a theory for characterization of DNA sequences based on their measure representation. The measures are shown to be random cascades generated by an infinitely divisible distribution. This probability distribution is uniquely determined by the exponent function in the multifractal theory of random cascades. Curve fitting to a large number of complete genomes of bacteria indica...

متن کامل

Multifractal and correlation analyses of protein sequences from complete genomes.

A measure representation of protein sequences similar to the measure representation of DNA sequences proposed in our previous paper [Yu et al., Phys. Rev. E 64, 031903 (2001)] and another induced measure are introduced. Multifractal analysis is then performed on these two kinds of measures of a large number of protein sequences derived from corresponding complete genomes. From the values of the...

متن کامل

Measure representation and multifractal analysis of complete genomes.

This paper introduces the notion of measure representation of DNA sequences. Spectral analysis and multifractal analysis are then performed on the measure representations of a large number of complete genomes. The main aim of this paper is to discuss the multifractal property of the measure representation and the classification of bacteria. From the measure representations and the values of the...

متن کامل

O-44: Characterisation of Monotreme CaseinsReveals Lineage Specific Expansion of an AncestralCasein Locus in Mammals

Background: One important reproductive characteristic of Mammals is the production of milk to nurse the neonate. In order to better understand the evolution of milk we have investigated gene expression in milk cells from monotremes which are the most ancient representative of the mammalian lineage. Materials and Methods: Using a milk cell cDNA sequencing approach we characterise milk protein se...

متن کامل

Multifractal characterisation of length sequences of coding and noncoding segments in a complete genome

The coding and noncoding length sequences constructed from a complete genome are characterised by multifractal analysis. The dimension spectrum Dq and its derivative, the ’analogous’ specific heat Cq, are calculated for the coding and noncoding length sequences of bacteria, where q is the moment order of the partition sum of the sequences. From the shape of the Dq and Cq curves, it is seen that...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001